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^! . Abstract 

o 

Q\ | Inclusive charm and beauty cross sections are measured in e p and e + p neutral current 

collisions at HERA in the kinematic region of photon virtuality 5 < Q 2 < 2000 GeV 2 and 
Bjorken scaling variable 0.0002 < x < 0.05. The data were collected with the HI detector 
in the years 2006 and 2007 corresponding to an integrated luminosity of 189 pb _1 . The 
$_i 1 numbers of charm and beauty events are determined using variables reconstructed by the 

HI vertex detector including the impact parameter of tracks to the primary vertex and the 
position of the secondary vertex. The measurements are combined with previous data and 
compared to QCD predictions. 
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1 Introduction 



The measurement of the inclusive charm (c) and beauty (b) quark cross sections and the de- 
rived structure functions F 2 CC and in DIS at HERA is an important test of the theory of the 
strong interaction, quantum chromodynamics (QCD), within the Standard Model. These mea- 
surements uniquely constrain the parton density functions (PDFs) of the proton, in particular its 
b and c content. Precise knowledge of the PDFs is for example essential at the Large Hadron 
Collider (LHC). The predictions of the 'standard candle' QCD processes at the LHC, such as 
the inclusive production of W and Z bosons, are sensitive to the theoretical treatment of heavy 
quarks [1-7]. The b quark density is important in Higgs production at the LHC in both the 
Standard Model and in extensions to the Standard Model such as supers ymmetric models at 
high values of the mixing parameter tan (3 [8]. 

This paper reports on measurements made in neutral current deep inelastic scattering (DIS) 
at HERA of the charm and beauty contributions to the inclusive proton structure function F 2 
in the range of virtuality of the exchanged photon 5 < Q 2 < 2000 GeV 2 and Bjorken x 
0.0002 < x < 0.05. The analysis uses the precise spatial information from the HI vertex 
detector to separate events containing c and b flavoured hadrons from light quark events. The 
analysis extends to lower and higher Q 2 than previous HI measurements [9, 10] which used a 
similar technique to the one used in this paper. 

The analysis is based on a dataset with an integrated luminosity of 189 pb" 1 , which is about 
three times greater than in the previous measurements. The data was recorded in the years 
2006 and 2007 with 54 pb" 1 taken in e~p mode and 135 pb -1 in e + p mode. The ep centre of 
mass energy is y/s = 319 GeV, with a proton beam energy of E p = 920 GeV and electron 
beam energy of E e = 27.6 GeV. This dataset is referred to here as HERA II. Many details 
of the analysis are similar to the previous measurements [9, 10], referred to here as HERA I. 
The HERA I and HERA II measurements are combined to produce a complete HERA dataset. 
Measurements of the charm contribution to the proton structure function have also been made 
at HERA using D or D* meson production [11, 12]. There are also measurements of charm and 
beauty in DIS using semi-leptonic decays [13]. 

Events containing heavy quarks are distinguished from those containing only light quarks 
using variables that are sensitive to the longer lifetimes of heavy flavour hadrons. The most 
important of these variables are the transverse displacement of tracks from the primary vertex 
and the reconstructed position of a secondary vertex in the transverse plane. For events with 
three or more tracks in the vertex detector the reconstructed variables are used as input to an 
artificial neural network. This method has better discrimination between c and b compared 
to previous methods [9, 10], which used only the transverse displacement of tracks from the 
primary vertex. Lifetime based methods have the advantage over more exclusive methods, 
such as D* or muon tagging, in that a higher fraction of heavy flavour events may be used, 
although the background from light quark events is larger. The charm structure function F 2 CC 
and the beauty structure function are obtained from the measured c and b cross sections 
after applying small corrections for the longitudinal structure functions F£ c and Fj*. 
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2 Monte Carlo Simulation 



Monte Carlo simulations are used to correct for the effects of the finite detector resolution, 
acceptance and efficiency. The Monte Carlo program RAPGAP [14] is used to generate DIS 
events for the processes ep — > ebbX, ep — > eccX and ep — > eqX where q is a light quark of 
flavour u, d or s. RAPGAP combines 0{a s ) matrix elements with higher order QCD effects 
modelled by parton showers. The heavy flavour event samples are generated according to the 
'massive' photon gluon fusion (PGF) matrix element [15] with the mass of the c and b quarks 
set to m c = 1.5 GeV and m b = 4.75 GeV, respectively. The DIS cross section is calculated us- 
ing the leading order (LO) 3-flavour PDFs from MRST (MRST2004F3LO [16]). The partonic 
system for the generated events is fragmented according to the Lund string model [17] imple- 
mented within the PYTHIA program [18]. The c and b quarks are hadronised according to the 
Bowler fragmentation function [19]. The HERACLES program [20] interfaced to RAPGAP 
calculates single photon radiative emissions off the lepton line, virtual and electroweak correc- 
tions. The Monte Carlo program PHOJET [21] is used to simulate the background contribution 
from photoproduction ^p — ► X. 

The samples of generated events are passed through a detailed simulation of the detector 
response based on the GEANT3 program [22], and through the same reconstruction software as 
is used for the data. 



3 HI Detector 

Only a short description of the HI detector is given here; a more complete description may be 
found in [23]. A right handed coordinate system is employed with the origin at the position 
of the nominal interaction point that has its Z-axis pointing in the proton beam, or forward, 
direction and X (Y) pointing in the horizontal (vertical) direction. The pseudorapidity is related 
to the polar angle 9 by rj = — lntan(0/2). 

Charged particles are measured in the central tracking detector (CTD). This device consists 
of two cylindrical drift chambers interspersed with Z-chambers to improve the Z-coordinate 
reconstruction and multi-wire proportional chambers mainly used for triggering. The CTD is 
operated in a uniform solenoidal 1.16 T magnetic field, enabling the momentum measurement 
of charged particles over the polar angular range 20° < 9 < 160°. 

The CTD tracks are linked to hits in the vertex detector, the central silicon tracker CST [24], 
to provide precise spatial track reconstruction. The CST consists of two layers of double-sided 
silicon strip detectors surrounding the beam pipe, covering an angular range of 30° < 9 < 150° 
for tracks passing through both layers. The information on the Z-coordinate of the CST tracks is 
not used in the analysis presented in this paper. For CTD tracks with CST hits in both layers the 
transverse distance of closest approach (DCA) to the nominal vertex in X-Y, averaged over the 
azimuthal angle, is measured to have a resolution of 43 /im©51 fxm/(P T [GeV}) where Pt is the 
transverse momentum of the particle. The first term represents the intrinsic resolution (including 
alignment uncertainty) and the second term is the contribution from multiple scattering in the 
beam pipe and the CST. 
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The track detectors are surrounded in the forward and central directions (4° < 6 < 155°) 
by a fine-grained liquid argon calorimeter (LAr) and in the backward region (153° < 9 < 178°) 
by a lead-scintillating fibre calorimeter (SPACAL) [25] with electromagnetic and hadronic sec- 
tions. These calorimeters are used in this analysis to measure and identify the scattered electron^ 
and also provide energy and angular reconstruction for final state particles from the hadronic 
system. 

Electromagnetic calorimeters situated downstream in the electron beam direction allow de- 
tection of photons and electrons scattered at very low Q 2 . The luminosity is measured with 
these calorimeters from the rate of photons produced in the Bethe-Heitler process ep — > epy. 



4 Experimental Method 
4.1 DIS Event Selection 

The events are triggered by requiring a compact, isolated electromagnetic cluster in either the 
LAr or SPACAL calorimeters with an overall trigger efficiency of almost 100%. The electro- 
magnetic cluster with the highest transverse energy, which also passes stricter offline criteria is 
taken as the scattered electron. The Z-position of the interaction vertex, reconstructed by one 
or more charged tracks in the tracking detectors, must be within ±20 cm to match the accep- 
tance of the CST The interaction vertex approximately follows a Gaussian distribution with a 
standard deviation of 13 cm. 

Photoproduction events and DIS events with a hard photon radiated from the initial state 
electron are suppressed by requiring ~ Pz,i) > 35 GeV. Here, Ei and p Zii denote the 

energy and longitudinal momentum components of a particle and the sum is over all final state 
particles including the scattered electron and the hadronic final state (HFS). The HFS particles 
are reconstructed using a combination of tracks and calorimeter deposits in an energy flow 
algorithm that avoids double counting [26] . 

The event kinematics, including the photon virtuality Q 2 , the Bjorken scaling variable x and 
the inelasticity variable y, are reconstructed with the 'eS' method [27], which uses the scattered 
electron and the HFS. The variables obey the relation x = Q 2 /sy. In order to have good 
acceptance in the SPACAL and to ensure that the HFS has a significant transverse momentum, 
events are selected for Q 2 > 4.5 GeV 2 . The analysis is restricted to y > 0.07 in order to ensure 
that the direction of the quark which is struck by the photon is mostly in the CST angular range. 
An upper y cut is applied that varies from 0.5 at low Q 2 to 0.85 at high Q 2 in order to suppress 
photoproduction background. The measurement is made differentially by dividing the data into 
discrete y-Q 2 intervals. This binning scheme is preferable to one using x-Q 2 boundaries as it 
avoids event losses near the cuts on y. The RAPGAP Monte Carlo program is used to estimate 
the acceptance of c (b) events that have a c (b) quark with tranverse momentum greater than 
0.3 GeV and within the angular acceptance of the CST. The overall c (6) quark acceptance is 
89% (90%) with a minimum of 75% (75%) in any y-Q 2 bin. 

'in this paper we use 'electron' to also denote 'positron' unless explicitly stated. 
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The position of the beam interaction region in X-Y (beam spot) is calculated from infor- 
mation of tracks with CST hits and updated regularly to account for drifts during beam storage. 
The beam interaction region has an elliptical shape with a size of around 90 /xm in x and around 
22 fim in Y. 

4.2 Track Selection 

The impact parameter 5 of a track, which is the transverse DCA of the track to the primary 
vertex point (see figured]), is only determined for those tracks which are measured in the CTD 
and have at least two CST hits linked (referred to as CST tracks). The beam spot is used as 
the position of the primary vertex. CST tracks are required to have a transverse momentum 

P^ rack > 0.3 GeV. 

The direction of the struck quark is used in the determination of the sign of 5. The vector 
of the struck quark (P^, 9 q , <p q ) is reconstructed as the azimuthal angle of the highest transverse 
momentum jet in the event. If there is no jet reconstructed in the event the vector is reconstructed 
from the electron [28] so that P| = P|, lec , cos 9 = l-SE 2 y 2 /(AE 2 y 2 +Q 2 (l-y)), <p q = 180°- 
elec , where P|. lec and elec are the transverse momentum and the azimuthal angle in degrees of 
the scattered electron, respectively. Jets are reconstructed using the inclusive longitudinally 
invariant k T algorithm with massless P T recombination scheme and distance parameter R = 1 
in the rj — <p plane [29]. The algorithm is run in the laboratory frame using all reconstructed 
HFS particles and the resultant jets are required to have transverse momentum greater than 1.5 
GeV and to be in the angular range 15° < 9 < 155°. Approximately 95% (99%) of c (b) events 
have 4> q reconstructed from a jet, as determined from the Monte Carlo simulation. Tracks with 
azimuthal angle (/> tr ack outside ±90° of (f> q are assumed not to be associated to the struck quark 
and rejected. 

If the angle a between <f) q and the line joining the primary vertex to the point of DCA is less 
than 90°, S is defined as positive. It is defined as negative otherwise. FigureQ] shows an example 
of a track with positive 5 and one with negative 5. The 5 distribution, shown in figure [2] is seen 
to be asymmetric with positive values in excess of negative values indicating the presence of 
long-lived particles. It is found to be well described by the Monte Carlo simulation. CST tracks 
with \5\ > 0.1 cm are rejected to suppress light quark events containing long-lived strange 
particles. 

The number of reconstructed CST tracks iV tra ck associated to the struck quark is an important 
quantity since for higher track multiplicities more information can be used. In the kinematic 
range of this measurement 22% of the events have no selected track, 26% of the events have 

iV track = 1, 23% have iV track = 2 and 29% have iV track > 3. 

4.3 Secondary Vertex Reconstruction 

The complete set of reconstructed tracks in the event is used to simultaneously reconstruct a 
secondary and primary vertex in the transverse plane. Two vertices are reconstructed in each 
event even if they are not statistically separable. Each track is assigned a weight for each 
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vertex, which is a measure of the probability that the track originated at that vertex [30]. In this 
approach tracks are not assigned unambiguously to one vertex or the other. A simultaneous fit to 
the primary and secondary vertex is made, with the weights of all tracks of the event considered 
for the primary vertex, but only the weights of CST tracks considered for the secondary vertex. 
The beam spot together with its spread is used as an additional constraint to the primary vertex. 
The vertex configuration that minimises the global \ 2 is found iteratively using deterministic 
annealing [31]. 

The transverse distance between the secondary and primary vertex is defined as L xy . The 
secondary vertex significance Sl is L xy /a(L xy ), where o{L xy ) is the uncertainty on L xy . If the 
absolute difference between the azimuthal angle of the secondary vertex (taking the primary 
vertex as the origin) and <\fl is less than 90°, Sl is signed positive and negative otherwise. A 
measure of the decay multiplicity N^[ ck is made by counting the number of tracks with weight 
greater than 0.8 to the secondary vertex. This method was shown in [9] to yield consistent 
results with the default method that used track significances. 

4.4 Quark Flavour Separation 

The track significance is defined as 8/a{5), where a (5) is the uncertainty on 5. The significances 
Si, S 2 an d S3, are defined as the significance of the CST track with the highest, second highest 
and third highest absolute significance, respectively. The significances take the sign of S (see 
section |4~2|) . Tracks with a negative sign for significance are likely not to arise from particles 
with a large lifetime and are used in this analysis to estimate the light quark contribution. 

Tracks that do not have the same significance sign as Si are ignored. The Si distribution 
is used for events with one remaining CST track after this selection and the S 2 distribution is 
used if there are two remaining CST tracks. For events with three or more remaining CST 
tracks, where more information is available, information is combined from the significance 
distributions and the reconstructed secondary vertex using an artificial neural network (NN) 
that takes into account correlations of the input variables [32]. In this way each event with 
at least 1 CST track appears in exactly one distribution and the distributions are statistically 
independent. The Si and S2 distributions are shown in figure[3l 

The NN has inputs of S u S 2 , S 3 , S L , N tTack , N^ ck and P£ rack of the CST tracks with the 
highest and second highest transverse momentum. The NN has one hidden layer with 5 nodes. 
It was trained using a sample of around 5000 Monte Carlo b events as 'signal' and a similar 
number of Monte Carlo c events as 'background' . No attempt is made to discriminate against 
the light quark events since their impact is minimized by the subtraction procedure described 
below. The same NN is used for all y-Q 2 bins. The distributions of the NN inputs are shown 
in figures @] and |5j These distributions are dominated by light quark events. It can be seen 
that the Monte Carlo simulation gives a reasonable description of these distributions. It is also 
apparent that these distributions have good separation power between light, c and b events. The 
decrease in events around zero for the Si and S 2 distributions is due to the requirement that 
I Si I > I S 2 1 > I S 3 1 . In order to see how well the Monte Carlo simulation describes the heavy 
flavour contribution of the NN input distributions, the distribution for those events with S 2 > 3 
is taken and the distribution for those events with S 2 < —3 is subtracted from it. This has the 
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effect of greatly reducing the light quark distribution which is almost symmetric in S 2 . This 
subtraction method can be used for any distribution. The subtracted distributions of the NN 
inputs are shown in figures [6] and |7J The Monte Carlo simulation gives a good description of 
these distributions. It can be seen that b events tend to have a higher track multiplicity and more 
tracks at higher P T . Other distributions are checked in a similar way. Distributions of variables 
related to the struck quark, P^ and 7] q , as well as the kinematic variables logQ 2 and logx are 
shown in figure [8] for iV track > 2. The Monte Carlo simulation gives a good description of these 
distributions. 

The output of the NN is shown in figure It gives output values in the range from about 
0.2 to 0.95. The NN output is signed according to the sign of Si. It can be seen that: the light 
quark distribution is approximately symmetric and peaks towards low absolute values; the c and 
b distributions are asymmetric with more positive than negative entries; the b events are peaked 
towards 1, whereas the c events are peaked towards 0. The Monte Carlo simulation gives a good 
description of the distribution. 

Since the light quark Si, S 2 and NN distributions are nearly symmetric around zero the 
sensitivity to the modelling of the light quarks can be reduced by subtracting the contents of the 
negative bins from the contents of the corresponding positive bins. The subtracted distributions 
are shown in figure [TOl The resulting distributions are dominated by c quark events, with a b 
quark fraction increasing towards the upper end of the distributions. The light quarks contribute 
a small fraction, although this fraction is larger than in [10], mainly due to the lower Q 2 and 
pfa-ack se i ec tions in the present analysis. 

The fractions of c, b and light quarks of the data are extracted in each y-Q 2 interval using 
a least squares simultaneous fit to the subtracted Si, S 2 and NN distributions (as in figure [TOl 
and the total number of inclusive events before any CST track selection. Only those bins in 
the significance distributions which have at least 25 events before subtraction are considered in 
the fit, since Gaussian errors are assumed. The last fitted bin of the significance distributions, 
which usually has the lowest statistics, is made 3 times as wide as the other bins (see figure [TOl). 
If any of the bins before subtraction within the NN output range contain less than 25 events the 
bin size is doubled, which ensures all bins contain at least 25 events. 

The light, c and b Monte Carlo simulation samples are used as templates. The Monte Carlo 
light, c and b contributions in each y-Q 2 interval are scaled by factors pi, p c and p&, respectively, 
to give the best fit to the observed subtracted Si, S 2 and NN distributions and the total number of 
inclusive events in each y — Q 2 interval. Only the statistical errors of the data and Monte Carlo 
simulation are considered in the fit. The fit to the subtracted significance and NN distributions 
mainly constrains p c and p&, whereas the overall normalisation constrains p\. 

Since the error on p c is much smaller than that of pi, the c cross section is measured in more 
y-Q 2 intervals than the b cross section. Therefore two sets of fits are performed, one with a 
fine binning of 29 bins and the other with a coarse binning of 12 bins. The two sets of fits are 
performed in an identical manner. The results of the two sets of fits are listed in tables [TJ and [2] 
Also included in the tables are the x 2 /n.d.f. evaluated using statistical errors only. Acceptable 
values are obtained for all bins. The fit is also performed to the complete data sample and shown 
in figure \W[ The stability of the method is checked by repeating the fit to the complete data in 
a variety of ways: fitting the e + p and e~p data separately; using S 3 instead of the NN; using 
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Sl instead of the NN; using the NN alone without Si and S 2 ; using Si and S 2 without the NN; 
using the NN alone without negative subtraction. All give consistent results within statistical 
and systematic errors. 

The results of the fit in each y-Q 2 interval are converted to a measurement of the 'reduced 
c cross section' defined from the differential cross section as 

o*t x &) = -^1 ^ (1) 

using: 

p c N MCsen 

a cc (x, Q 2 ) = a{x, Q 2 ) \ r MCgon , ^Mci^^cc, (2) 

piN l + p c N c 8 + p b N b 

where a is the fine structure constant evaluated with scale Q 2 , and A r i MCgcn , N^ Cscn , and 
are the number of light, c, and b quark events generated from the Monte Carlo simula- 
tion in each bin. The inclusive reduced cross section a(x, Q 2 ) is taken from HI measurements: 
Tables 17 and 19 from [33], Tables 10 and 11 from [34] and Table 11 from [35]. A bin centre 
correction 5-qcc is applied using a NLO QCD expectation for a cc and a to convert the bin av- 
eraged measurement into a measurement at a given x-Q 2 point. This NLO QCD expectation 
is calculated from the results of a fit similar to that performed in [36] but using the massive 
scheme to generate heavy flavours. 

The reduced cross section is corrected using the Monte Carlo simulation for pure QED 
radiative effects. The photoproduction background is not subtracted, which, due to the method 
used to calculate the reduced cross sections, means that the fraction of c and b events in the 
photoproduction background is assumed to be the same as in the DIS data. In most of the y 
range the background from photoproduction events is negligible and does not exceed 4% in any 
y-Q 2 interval used in this analysis. Events that contain c hadrons via the decay of b hadrons 
are not included in the definition of the reduced c cross section. The reduced b cross section is 
evaluated in the same manner. 



5 Systematic Uncertainties 

The following uncertainties are taken into account in order to evaluate the systematic error. 

• An uncertainty in the 5 resolution of the tracks is estimated by varying the resolution by 
an amount that encompasses any difference between the data and simulation evaluated 
from figure [2] This was achieved by applying an additional Gaussian smearing in the 
Monte Carlo simulation of 200 /im to 5% of randomly selected tracks and 12 pm to the 
rest. 

• A track efficiency uncertainty is assigned of 1% due to the CTD and of 2% due to the 
CST. 
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The uncertainties on the various D and B meson lifetimes, decay branching fractions and 
mean charge multiplicities are estimated by varying the input values of the Monte Carlo 
simulation by the errors on the world average measurements. For the branching fractions 
of b quarks to hadrons and the lifetimes of the D and B mesons the central values and 
errors on the world averages are taken from [37]. For the branching fractions of c quarks 
to hadrons the values and uncertainties are taken from [38], which are consistent with 
measurements made in DIS at HERA [39]. For the mean charged track multiplicities the 
values and uncertainties for c and b quarks are taken from Marklll [40] and LEP/SLD [41] 
measurements, respectively. 

The uncertainty on the fragmentation function of the heavy quarks is estimated by re- 
weighting the events according to the longitudinal string momentum fraction z carried by 
the heavy hadron in the Lund model using weights of (1 =F 0.7) • (1 — z) + z ■ (1 ± 0.7) for 
charm quarks and of (1 =F 0.5) • (1 — z) + z ■ (1 ± 0.5) for beauty quarks. The variations 
for the charm fragmentation are motivated by comparison of the Monte Carlo simulation 
with HI data [42]. 

An uncertainty on the QCD model of heavy quark production is estimated by reweight- 
ing the jet transverse momentum and pseudorapidity by (P^ ct /(10 GeV)) ±0,2 and (1 ± 
V ct ) ±0 - 15 for charm jets and (P^ ct /(10 GeV)) ±a3 and (li^io.a for bottom jets. These 
values are obtained by comparing these variations with the measured reduced cross sec- 
tion for b and c jets [43]. The effects of each of these uncertainties on the subtracted 
reconstructed distributions of and r] q are shown in figure [8l where the data is seen to 
be consistent with the Monte Carlo simulation within the uncertainties. 

The uncertainty on the asymmetry of the light quark 5 distribution is estimated by repeat- 
ing the fits with the subtracted light quark distributions (figure [101) changed by ±30%. 
The light quark asymmetry was checked to be within this uncertainty by comparing the 
asymmetry of Monte Carlo simulation events to that of the data for K° candidates, in the 
region 0.1 < \5\ < 0.5 cm, where the light quark asymmetry is enhanced. 

An error on <\fl is estimated by shifting <p q by 2°(5°) for events with (without) a recon- 
structed jet. These shifts were estimated by comparing the difference between <\> q and the 
track azimuthal angle in data and Monte Carlo simulation. 

The uncertainty on the hadronic energy scale is estimated by changing the hadronic en- 
ergy by ±2%. 

The uncertainty in the photoproduction background is taken as 100% of the fraction of 
photoproduction events in each bin, for events with Arrack > 1. 

Uncertainties on the acceptance and bin centre correction due to the input heavy quark 
structure functions used are estimated by reweighting the input a cc distribution by x ±0 1 
and 1 ± 0.21n[Q 2 /(10 GeV 2 )] and a bl by x ±03 and 1 ± 0.4m[Q 2 /(10 GeV 2 )]. The 
range of variation of the input structure functions was estimated by comparing to the 
measured values obtained in this analysis. The effects of each of these uncertainties on 
the subtracted distributions of log Q 2 and log x are shown in figure [H where the data is 
seen to be consistent with the Monte Carlo simulation within the uncertainties. 
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The above systematic uncertainties are evaluated by making the changes described above 
to the Monte Carlo simulation and repeating the procedure to evaluate the reduced c and b 
cross sections, including the fits. The uncertainties are evaluated separately for each x-Q 2 
measurement. 

Additional contributions to the systematic error due to the inclusive cross section are taken 
from the corresponding x-Q 2 bin in [33-35], since the measurements are normalised to the 
inclusive cross section measurements (see equation |2]). The error due to the inclusive DIS 
selection includes a 1.1-1.5% uncertainty on the luminosity measurement; an uncertainty on the 
scattered electron polar angle of 0.2-3.0 mrad and energy of 0.2-2.0% depending on the energy 
and angle; typically < 1% combined error due to trigger and scattered electron identification 
efficiency; and a 0.5-1.0% uncertainty on the reduced cross section evaluation due to QED 
radiative corrections. 

A detailed list of the systematic effect on each reduced cross section measurement is given 
in tables [3] and @] The errors of 5 resolution and track efficiency are considered uncorrected 
between the HERA I and HERA II datasets. All other errors are assumed 100% correlated. 



6 Results 



6.1 Comparison and Combination of Data 

The measurements of a cc and a bb are shown as a function of x for fixed values of Q 2 in fig- 
ures [TT] and [T2] respectively. Also shown in these figures are the HERA I data extracted using 
measurements based on the displacement of tracks [9, 10]. The HERA I measurements for 
Q 2 < 60 GeV 2 are normalised to the HI inclusive cross sections measurements from [33] 
and [34]. The a cc and a bb data from HERA I and HERA II show good agreement for all mea- 
sured x and Q 2 values. In figure [TT] the a cc data are also compared with those extracted from 
D* meson measurements by HI [12], which were obtained using a NLO program [44] based on 
DGLAP evolution to extrapolate the measurements outside the visible D* range. The D* data 
agree well with the measurements from the present analysis. 

The HERA I and HERA II datasets are combined for each x-Q 2 point where there are 
two measurements. The combination is performed using a weighted mean using those errors 
considered uncorrelated between the two data sets (the statistical errors and the systematic errors 
of S resolution and track efficiency, see section [5]): 

fcomb = w T aj + wna u , (3) 



S 2 b 2 
n a °i 

«T]2 aIld W H = 72— A 2 



with wj = r9 11 r9 and w n = 2 1 2 , (4) 



where cr com b is the combined measurement and a\ and an are the HERA I and HERA II mea- 
surements respectively, with their respective errors of 5j = Yli and = Yli ^n> where i 
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denotes the statistical error and each source of uncorrelated error between the two data sets. 
These contributions to the combined error are evaluated as: 

(Cnb) 2 = M) 2 + KA\) 2 - (5) 

For those systematics j considered correlated between the two datasets (all apart from the errors 
of S resolution and track efficiency) the systematic error as evaluated from HERA II is taken, 
The total error on the combined measurement is therefore evaluated from: 

(Cnb) 2 = E^omb) 2 + EWl) 2 - 

i j 

All data that is subsequently shown is from the combined dataset. Tables [3] and @] list the 
combined results for c and b. The results listed in these tables supersede the HERA I mea- 
surements published in [9] and [10]. If fits are performed with the data listed in these tables it 
should be noted that, although each x-Q 2 measurement is statistically independent, there is a 
correlation between the c and b measurements. Since the c and b binning is very different this 
correlation may be neglected for most bins. However, for the first and last bin the binning is 
identical so the correlation coefficient C bc listed in table \T\ should be used. 



6.2 Comparison with QCD 

The leading contribution to heavy flavour production in the region Q 2 <M 2 is given by the mas- 
sive boson-gluon fusion matrix element [15,45] convoluted with the gluon density of the proton. 
In the region where Q 2 is much larger than M 2 the massive approach may be a poor approxima- 
tion due to the large logarithms log Q 2 /M 2 which are not resummed [2]. Here, the heavy quarks 
can be treated as massless partons with the leading order contribution coming from the quark 
parton model and the heavy quark parton densities. In QCD fits to global hard- scattering data 
the parton density functions are usually extracted using the general mass variable flavour num- 
ber scheme (GM VFNS) [1-4] for heavy quarks which interpolates from the massive approach 
at low scales to the massless approach at high scales. 

The data are compared with recent QCD predictions based on the GM VFNS from MSTW 
[7] (at NLO and NNLO), CTEQ [6] (at NLO) and from HI [34] (at NLO). The MSTW pre- 
dictions use the MSTW08 PDFs which have m c = 1.4 GeV and m b = 4.75 GeV, and the 
renormalization and factorization scales are set to fj, r — fj,f — Q. The CTEQ predictions use the 
CTEQ6.6 PDFs where m c = 1.3 GeV, m b = 4.5 GeV and /i r = fi f = ^Q 2 + M 2 . The pre- 
dictions from HI use the HI PDF 2009 PDFs and the same heavy flavour treatment, including 
the quark masses and perturbative scales, as for the MSTW08 NLO predictions [7]. 

The data are also compared with predictions based on CCFM [46] parton evolution and 
massive heavy flavour production. The CCFM predictions use the AO PDF set [47] with m c = 
1.4 GeV, m b = 4.75 GeV and /i r = [i$ = + Q\, where s is the square of the partonic 
centre of mass energy and Qt is the transverse momentum of the heavy quark pair. 

The a cc data as a function of x for fixed values of Q 2 are compared in figure[T3]wifh the QCD 
predictions from CCFM, CTEQ and MSTW at NNLO, and in figure Q4] with the predictions 
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from HI and MSTW at NLO. In figure US the GM VFNS predictions from CTEQ and MSTW 
at NNLO are observed to be similar with the size of the largest differences between the two 
being at the level of the total experimental errors on the data. The CTEQ and MSTW predictions 
provide a reasonable description of the rise of the data with decreasing x across the whole of the 
measured kinematic range thus supporting the validity of PDFs extracted using the GM VFNS. 
The predictions based on CCFM evolution tend to undershoot the data at the lowest values of 
Q 2 and x but also provide a reasonable description for the rest of the measured phase space. 
In figure [H] the GM VFNS predictions from HI and MSTW at NLO, which implement the 
same heavy flavour treatment [7], are similar and also provide a reasonable description of the 
data. The HI predictions are shown with uncertainty bands representing the experimental and 
theoretical uncertainties [34]. The inner error band describes the experimental fit uncertainty, 
the middle error band represents the experimental and model uncertainties added in quadrature 
and the outer error band represents the fit parameterisation uncertainty added in quadrature with 
all the other uncertainties. The largest contribution to the uncertainty comes from the model 
which is dominated by the variation of the charm quark mass, which is varied from 1.38 to 
1.47 GeV. The total uncertainties on the H1PDF 2009 reduced charm cross section predictions 
are generally smaller than those on the data. 

The a bb data as a function of x for fixed values of Q 2 are compared with the QCD predic- 
tions in figures \\5\ and [16l In figure [15] the CTEQ and CCFM predictions are seen to be very 
similar across the whole range of the measurements. The MSTW NNLO predictions are around 
35% higher than CTEQ and CCFM at low values of Q 2 , with the difference decreasing with 
increasing values of Q 2 . The differences between the theory predictions for the b cross section 
at low Q 2 are much reduced compared with the theoretical status at the time of the HERA I pub- 
lication where there was a factor 2 difference at Q 2 = 12 GeV 2 [10]. In figure [J6]the MSTW 
and HI NLO QCD predictions for a bb are observed, as for the case of a cc , to be very similar. 
The uncertainty on the HI predictions is again dominated by the model uncertainty due to the 
variation of the b quark mass, which is varied from 4.3 to 5.0 GeV. At lower values of Q 2 the 
uncertainties on the HI PDF predictions are larger than those on the data. The reduced b cross 
section data, including the points in the newly measured regions, are well described by all the 
present QCD predictions. 

The structure function F| c is evaluated from the reduced cross section 

yg = J ^- l + ( f_ y)a ^ W 

where the longitudinal structure function is estimated from the same NLO QCD expectation 
as used for the bin centre correction. The correction due to F£ c is negligible for most bins but 
contributes up to 6.7% of the reduced cross section at the highest value of y. The structure 
function F^ b is evaluated in the same manner. 

The measurements of F 2 CC and F 2 bb are presented in table [3] and shown as a function of 
Q 2 in figure [FT] and figure [T8] respectively. The data are compared with the GM VFNS QCD 
predictions from CTEQ [6] at NLO and from MSTW at NLO and NNLO [7] . The description of 
the charm data by the MSTW QCD calculations is reasonable, with the NNLO being somewhat 
better than NLO. The CTEQ NLO prediction also gives a reasonable description of the data. 
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The measurements are presented in figure [T9]in the form of the fractional contribution to the 
total ep cross section 

d 2 cc d 2 

pcc _ / _L_ (0\ 

J dxdg 2/ dxdg 2 ' v ' 

The 6 fraction / 6b is defined in the same manner. In the present kinematic range the value 
of f cc is around 17% on average and increases slightly with increasing Q 2 and decreasing x. 
The value of f bb increases rapidly with Q 2 from about 0.2% at Q 2 = 5 GeV 2 to around 1% for 
Q 2 >Q0 GeV 2 . The NNLO QCD predictions of MSTW shown in figure[19]are found to describe 
the data well. 



7 Conclusion 

The reduced charm and beauty cross sections in deep inelastic scattering are measured for a 
wide range of Q 2 and Bjorken x using the HERA II data. The analysis was performed using 
several variables including the significance (the impact parameter divided by its error) and the 
position of the secondary vertex as reconstructed from the vertex detector. For selected track 
multiplicities of 1 or 2 the highest and second highest significance distributions are used to 
evaluate the charm and beauty content of the data. For selected track multiplicities > 3 several 
variables are combined using an artificial neural network. 

The reduced cross sections agree with previous measurements using a similar technique, 
but have reduced errors and cover an extended Q 2 range. HERA I and HERA II data are 
combined resulting in more precise reduced cross section and structure function measurements. 
The charm and beauty fractional contributions to the total ep cross section are also measured. In 
this kinematic range the charm cross section contributes on average 17% and the beauty fraction 
increases from about 0.2% at Q 2 = 5 GeV 2 to 1.0% for Q 2 >60 GeV 2 . The measurements are 
described by predictions using perturbative QCD in the general mass variable flavour number 
scheme at NLO and NNLO. 
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Table 1: The fit parameters pi, p c and p&, along with their errors, the x 2 P er degree of freedom 
and the correlation coefficients of the fit parameters for each bin in Q 2 and x. The parameters 
are shown for the fine binning scheme used to evaluate the reduced c cross section. 
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Table 2: The fit parameters pi, p c and p b , along with their errors, the x 2 P er degree of freedom 
and the correlation coefficients of the fit parameters for each bin in Q 2 and x. The parameters 
are shown for the coarse binning scheme used to evaluate the reduced b cross section. 
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Table 3: The measured values and relative errors for the reduced DIS cross section (<t 9 ^) and structure function F^ q for charm (c) and 
beauty (b) quarks for the combined datasets. The values for F!f are obtained from the measured reduced cross sections using the NLO 
QCD fit to correct for the contributions from Ff}. The table shows the statistical error (5 sta t) 5 the systematic error (<5 sys ), the total error 
(#tot) and the uncorrelated systematic error (5 unc ) on the reduced cross section. The next ten columns show the effect of a +lcr shift for the 
correlated systematic error contributions to the reduced cross section from: track impact parameter resolution, CJC track efficiency, CST 
track efficiency, c fragmentation, b fragmentation, light quark contribution, struck quark angle 9 , hadronic energy scale, photoproduction 
background and the DIS event selection. The —la errors are taken as the negative of the +la errors. The correlated systematic errors are 
continued in tabled 
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Table 4: The correlated errors continued from tabled The first eight errors represent a +ler shift for the correlated systematic error 
contributions from: reweighting the Q 2 distribution, reweighting the x distribution, reweighting the jet transverse momentum distribution 
, and reweighting the jet pseudorapidity r]i ct distribution for c and b events. The remaining six columns show the contributions from: c 
hadron branching fractions and multiplicities, and the b quark multiplicity. Only those uncertainties where there is an effect of > 1% for any 
x-Q 2 point are listed; the remaining uncertainties are included in the uncorrelated error. 




Figure 1: Diagrams of a track in the X-Y plane. If the angle a is less than 90°, 5 is defined as 
positive otherwise 5 is defined as negative. 
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Figure 2: The distribution of the signed impact parameter 5 of a track to the primary vertex 
in the X-Y plane. Included in the figure is the expectation from the Monte Carlo simulation 
for light, c and b quarks. The contributions from the various quark flavours in the Monte Carlo 
are shown after applying the scale factors pi, p c and pb obtained from the fit to the complete 
data sample (see section |4~4"1) . The arrows indicate the range over which tracks are selected for 
analysis. 
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Figure 3: The significance 5 / a{5) distribution (a) of the highest absolute significance track (Si) 
and (b) of the track with the second highest absolute significance (S2). Included in the figure 
is the expectation from the Monte Carlo simulation for light, c and b quarks. The contributions 
from the various quark flavours in the Monte Carlo are shown after applying the scale factors 
pi, p c and pb obtained from the fit to the complete data sample. 
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Figure 4: Inputs to the NN for events with at least 3 CST tracks: | Si \ , \ S 2 \ , \ S 3 | and S L . Included 
in the figure is the expectation from the Monte Carlo simulation for light, c and b quarks. The 
contributions from the various quark flavours in the Monte Carlo are shown after applying the 
scale factors pi, p c and pb obtained from the fit to the complete data sample. 
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Figure 5: Inputs to the NN for events with at least 3 CST tracks: of the highest and 

second highest transverse momentum track, iV track and A^, k . Included in the figure is the 
expectation from the Monte Carlo simulation for light, c and b quarks. The contributions from 
the various quark flavours in the Monte Carlo are shown after applying the scale factors pi, p c 
and pb obtained from the fit to the complete data sample. 
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Figure 6: The subtracted distributions for \Si\, \S 2 \, \S 3 \ and S L . Each plot shows the difference 
between the distributions for those events with S 2 > 3 and the corresponding distribution with 
S 2 < —3. Included in the figure is the expectation from the Monte Carlo simulation for light, c 
and b quarks. The contributions from the various quark flavours in the Monte Carlo are shown 
after applying the scale factors pi, p c and pb obtained from the fit to the complete data sample. 
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Figure 7: The subtracted distributions for P^ rack of the highest and second highest transverse 
momentum track, iV track and N^. k . Each plot shows the difference between the distributions 
for those events with S 2 > 3 and the corresponding distribution with S 2 < —3. Included in 
the figure is the expectation from the Monte Carlo simulation for light, c and b quarks. The 
contributions from the various quark flavours in the Monte Carlo are shown after applying the 
scale factors pi, p c and pb obtained from the fit to the complete data sample. 
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Figure 8: The subtracted distributions for P%, rf, logQ 2 , logx. Each plot shows the difference 
between the distributions for those events with S 2 > 3 and the corresponding distribution with 
S2 < —3. Included in the figure is the expectation from the Monte Carlo simulation for light, 
c and b quarks. The contributions from the various quark flavours in the Monte Carlo are 
shown after applying the scale factors pi, p c and p b obtained from the fit to the complete data 
sample. Also shown are the variations in and r] q resulting from uncertainties in the QCD 
model of heavy quark production, as well as the variations in log Q 2 and log x, resulting from 
uncertainties on the input heavy quark structure functions. 
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Figure 9: The output of the NN. Included in the figure is the expectation from the Monte Carlo 
simulation for light, c and b quarks. The contributions from the various quark flavours in the 
Monte Carlo are shown after applying the scale factors pi, p c and p& obtained from the fit to the 
complete data sample. 
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Figure 10: The subtracted distributions of (a) Si and (b) S2 and (c) NN output. Included in the 
figure is the result from the fit to the complete data sample of the Monte Carlo distributions of 
the various quark flavours to obtain the scale factors pi, p c and pb- 
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Figure 11: The measured reduced cross section a cc shown as a function of x for different Q 2 
values. The inner error bars show the statistical error, the outer error bars represent the statistical 
and systematic errors added in quadrature. The HERA II measurements are compared with 
those from HERA I. The measurements obtained from D* mesons by HI [12] using HERA I 
data are also shown. The x values of the HERA I data are shifted for visual clarity. 
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Figure 12: The measured reduced cross section a bb shown as a function of x for different Q 2 
values. The inner error bars show the statistical error, the outer error bars represent the statistical 
and systematic errors added in quadrature. The HERA II measurements are compared with 
those from HERA I. The combined HI data are also shown. The x values of the HERA I and 
HERA II data are shifted for visual clarity. 
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Figure 13: The combined reduced cross section a cc shown as a function of x for different Q 2 
values. The inner error bars show the statistical error, the outer error bars represent the statistical 
and systematic errors added in quadrature. The predictions of QCD calculations are also shown. 



35 



H1 CHARM CROSS SECTION IN DIS 



lo 
o 



TTTTTj 1 I I I llllj 1 I I I I lll| 1 I I I I lll| 

Q 2 = 5 GeV 2 



I I I Mill 1 I I I I 



0.4 
0.2 


0.4 
0.2 


0.4 
0.2 


0.4 
0.2 






Q 2 = 20 GeV 2 




Q 2 = 8.5 GeV 2 



Q 2 = 35 GeV 2 




Q 2 =12GeV 2 



Q 2 = 60 GeV 2 




Q 2 = 120 GeV 2 



Q 2 = 200 GeV 2 



Q 2 = 300 GeV 2 




i i nun 1 1 



Q 2 = 650 GeV 2 1 




i i nun i i 



Q 2 = 2000 GeV 2 



ml I 




10 



-4 



10 



-2 



H1 



i i \ 




• H1 Data 

— H1 PDF 2009 

- - MSTW08 



I I I 




10 



10 



10 



10 



-2 



Figure 14: The combined reduced cross section a cc (as in figure [T3l). The predictions of the 
HI PDF 2009 and MSTW08 NLO QCD fits are also shown. For the HI QCD fit the inner error 
bands show the experimental uncertainty, the middle error bands include the theoretical model 
uncertainties of the fit assumptions, and the outer error band represents the total uncertainty 
including the parameterisation uncertainty. 35 
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Figure 15: The combined reduced cross section a bb shown as a function of x for different Q 2 
values. The inner error bars show the statistical error, the outer error bars represent the statistical 
and systematic errors added in quadrature. The predictions of QCD calculations are also shown. 
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Figure 16: The combined reduced cross section a bb (as in figure [T51). The predictions of the 
HI PDF 2009 and MSTW08 NLO QCD fits are also shown. For the HI QCD fit the inner error 
bands show the experimental uncertainty, the middle error bands include the theoretical model 
uncertainties of the fit assumptions, and the outer error band represents the total uncertainty 
including the parameterisation uncertainty. 
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Figure 17: The combined F 2 CC shown as a function of Q 2 for various x values. The inner error 
bars show the statistical error, the outer error bars represent the statistical and systematic errors 
added in quadrature. The predictions of QCD calculations are also shown. 
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Figure 18: The combined shown as a function of Q 2 for various x values. The inner error 
bars show the statistical error, the outer error bars represent the statistical and systematic errors 
added in quadrature. The predictions of QCD calculations are also shown. Note that some 
points have been interpolated in x for visual clarity. 
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Figure 19: The charm and beauty contributions to the total cross section, f cc and f bb , shown as 
a function of Q 2 for different x values. The inner error bars show the statistical error, the outer 
error bars represent the statistical and systematic errors added in quadrature. A prediction of 
NNLO QCD is also shown. The charm data point at x = 0.005 and Q 2 = 300 GeV 2 has been 
interpolated from x = 0.008. 



